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IN THE CLAIMS 

Please amend the claims as follows: 

1 . (Currently Amended) In a computer system having a plurality of processors connected to 
a shared memory, a method of decoupling an a write address from its corresponding write data in 
a store to the shared memory, comprising: 

generating a write request address for a memory write, wherein the write request address 
points to a memory location in the shared memory; 

issuing a write request to the shared memory, wherein the write request includes the write 
request address; 

noting the write request address in the shared memory; 

comparing, in the shared memory, addresses in subsequent load and store requests to the 
write request address; 

transferring the write data to the shared memory; 

matching, within the shared memory, the write request address to the its corresponding 
write data; and 

storing the write data into the shared memory as a function of the write request address. 

2. (Original) The method according to claim 1 , wherein the shared memory includes a store 
address buffer and wherein noting the write request address includes writing the address in the 
store address buffer. 

3. (Currently Amended) The method according to claim 2, wherein comparing addresses in 
subsequent read and write load and store requests includes stalling subsequent read requests to 
the write request address until the write data is written into the shared memory. 



4. (Original) The method according to claim 1, wherein the shared memory includes a 
cache, wherein noting the write request address includes changing a state in a cache line 
associated with the write request address to "WaitForData", and wherein comparing addresses in 
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subsequent load and store requests to the write request address includes accessing the cache and 
stalling if a cache line hit returns a "WaitForData" state. 

5. (Currently Amended) The method according to claim 1, wherein the shared memory 
includes a bit vector, wherein noting the write request address in the shared memory includes 
setting one or more bits in the bit vector corresponding to the write request address, and wherein 
comparing addresses in subsequent load and store requests to the write request address includes 
comparing bits that would be set corresponding to the load and store request addresses with the 
bits set for the write request address and stalling servicing of the load and store requests if there 
is a match. 

6. (Currently Amended) The method according to claim 1 , wherein comparing addresses in 
subsequent read and write load and store requests includes stalling the subsequent read load 
requests to the write request address until the write data is written into the shared memory. 

7. (Currently Amended) The method according to claim 6, wherein comparing addresses in 
subsequent read and write load and store requests includes servicing the load and store requests 
to addresses other than the write request address without waiting for the write data to be written 
to the write request address. 

8. (Currently Amended) The method according to claim 1, wherein comparing addresses in 
subsequent read and write load and store requests includes servicing the load and store requests 
to addresses other than the write request address without waiting for the write data to be written 
to the write request address. 

9. (Currently Amended) The method according to claim 1, wherein comparing addresses in 
subsequent read and write load and store requests includes enforcing memory ordering in 
subsequent read and write requests to the write request address until the write data associated 
with the first write request is written into the shared memory. 
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10. (Currently Amended) The method according to claim 1, wherein issuing a write request 
includes ensuring that all vector and scalar loads from shared memory for that processor have 
been sent to the shared memory prior to issuing the write request. 

1 1 . (Currently Amended) In a computer system having a plurality of processors connected to 
a shared memory, a method of decoupling an a write address from its corresponding write data in 
a write to the shared memory, comprising: 

generating a write request address for a memory write, wherein the write request address 
points to a memory location in shared memory; 

issuing a first write request to the shared memory, wherein the first write request includes 
the write request address; 

noting the write request address in the shared memory; 

comparing, in the shared memory, addresses in subsequent read and write requests to the 
write request address; 

stalling the subsequent read requests to the write request address until the write data 
corresponding to the first write request is written into the shared memory; and 

if the address in a subsequent write request matches the write request address stored in 
the shared memory and there are no stalled read requests to the write request address, discarding 
the first write request. 

12. (Original) The method according to claim 1 1 , wherein the shared memory includes a 
store address buffer and wherein noting the write request address includes writing the address in 
the store address buffer. 

13. (Original) The method according to claim 12, wherein comparing addresses in 
subsequent read and write requests includes stalling subsequent read requests to the write request 
address until the write data is written into the shared memory. 



14. (Currently Amended) The method according to claim 1 1 , wherein the shared memory 
includes a cache, wherein noting the write request address includes changing a state in a cache 
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line associated with the write request address to "WaitForData", and wherein comparing 
addresses in subsequent load and store read and write requests to the write request address 
includes accessing the cache and stalling if a cache line hit returns a "WaitForData" state. 

1 5 . (Currently Amended) The method according to claim 1 1 , wherein the shared memory 
includes a bit vector, wherein noting the write request address in the shared memory includes 
setting one or more bits in the bit vector corresponding to the write request address, and wherein 
comparing addresses in subsequent load and store read and write requests to the write request 
address includes comparing bits that would be set corresponding to the load and store request 
addresses the bits set for the write request address and stalling servicing of the load and store 
requests if there is a match. 

16. (Currently Amended) The method according to claim 11, wherein comparing addresses 
in subsequent read and write requests includes stalling the subsequent read requests to the write 
request address until the write data is written into the shared memory. 

17. (Currently Amended) The method according to claim 16, wherein comparing addresses 
in subsequent read and write requests includes servicing the load and store read and write 
requests to addresses other than the write request address without waiting for the write data to be 
written to the write request address. 

18. (Currently Amended) The method according to claim 11, wherein comparing addresses 
in subsequent read and write requests includes servicing the load and s tore read and write 
requests to addresses other than the write request address without waiting for the write data to be 
written to the write request address. 

19. (Currently Amended) The method according to claim 11, wherein comparing addresses 
in subsequent read and write requests includes enforcing memory ordering in the subsequent 
read and write requests to the write request address until the write data associated with the first 
write request is written into the shared memory. 
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20. (Currently Amended) The method according to claim 11, wherein issuing a write request 
includes ensuring that all vector and scalar loads from shared memory for that processor have 
been sent to the shared memory prior to issuing the write request. 

21 . (Currently Amended) In a computer system having a plurality of processors connected to 
a shared memory, a method of decoupling an a write address from its corresponding write data in 
a store to the shared memory, comprising: 

generating a write request address for a vector store to memory, wherein the write request 
address points to a memory location in the shared memory; 

issuing a vector store request to the shared memory, wherein the write request includes 
the write request address; 

noting the write request address in the shared memory; 

comparing, in the shared memory, addresses in subsequent load and store requests to the 
write request address; 

transferring the write data from a vector register to the shared memory; 

matching, within the shared memory, the write request address to the its corresponding 
write data; and 

storing the write data into the shared memory as a function of the write request address. 

22. (Original) The method according to claim 21, wherein the shared memory includes a 
store address buffer and wherein noting the write request address includes writing the address in 
the store address buffer. 

23. (Currently Amended) The method according to claim 22, wherein comparing addresses 
in subsequent read and write load and store requests includes stalling the subsequent read load 
requests to the write request address until the write data is written into the shared memory. 



24. (Original) The method according to claim 2 1 , wherein the shared memory includes a 
cache, wherein noting the write request address includes changing a state in a cache line 



AMENDMENT AND RESPONSE UNDER 37 CFR §1.111 Page 9 

Serial Number: 10/643,742 Dkt: 1376.697US1 

Filing Date: August 18, 2003 

Title: DECOUPLED STORE ADDRESS AND DATA IN A MULTIPROCESSOR SYSTEM 

associated with the write request address to "WaitForData", and wherein comparing addresses in 
subsequent load and store requests to the write request address includes accessing the cache and 
stalling if a cache line hit returns a "WaitForData" state. 

25. (Currently Amended) The method according to claim 21, wherein the shared memory 
includes a bit vector, wherein noting the write request address in the shared memory includes 
setting one or more bits in the bit vector corresponding to the write request address, and wherein 
comparing addresses in subsequent load and store requests to the write request address includes 
comparing bits that would be set corresponding to the load and store request addresses with the 
bits set for the write request address and stalling servicing of the load and store requests if there 
is a match. 

26. (Currently Amended) The method according to claim 21 , wherein comparing addresses 
in subsequent read and writ e load and store requests includes stalling the subsequent read load 
requests to the write request address until the write data is written into the shared memory. 

27. (Currently Amended) The method according to claim 26, wherein comparing addresses 
in subsequent r e ad and write load and store requests includes servicing the load and store 
requests to addresses other than the write request address without waiting for the write data to be 
written to the write request address. 

28. (Currently Amended) The method according to claim 21 , wherein comparing addresses 
in subsequent read and write load and store requests includes servicing the load and store 
requests to addresses other than the write request address without waiting for the write data to be 
written to the write request address. 

29. (Currently Amended) The method according to claim 21, wherein comparing addresses 
in subsequent read and write load and store requests includes enforcing memory ordering in the 
subsequent r e ad and write load and store requests to the write request address until the write data 
associated with the first write request is written into the shared memory. 
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30. (Currently Amended) The method according to claim 21, wherein issuing a write request 
includes ensuring that all vector and scalar loads from shared memory for that processor have 
been sent to the shared memory prior to issuing the write request. 

3 1 . (Currently Amended) A method of decoupling vector data stores from vector instruction 
execution, comprising: 

executing a vector instruction on vector data stored in a vector register, wherein 
executing a vector instruction includes storing result vector data in [[a]] the vector register; 
generating a vector write address for a vector store; 

issuing a vector store request to memory, wherein the vector store request includes the 
vector write address; 

noting the vector write address in the memory; 

comparing, in the memory, addresses in subsequent read and write requests to the vector 
write address; 

transferring result vector data from the vector register to the memory; 
matching the vector store requ e st write address and its corresponding result vector data in 
the memory; and 

storing the result vector data into the memory as a function of the address in the vector 
store request. 

32. (Currently Amended) The method according to claim 31, wherein matching includes 
comparing addresses in subsequent read and write requests to the vector write address and 
includes stalling the subsequent read requests to the vector write address until the result vector 
data is written into the memory. 



33. (Canceled) 
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34. (Currently Amended) In a processor having a plurality of processing units connected to a 
shared memory, a method of decoupling an a write address from its corresponding write data in a 
write to the shared memory, comprising: 

generating a write request address for a memory write, wherein the write request address 
points to a memory location in the shared memory; 

issuing a write request to the shared memory, wherein the write request includes the write 
request address; 

storing the write request address in the shared memory; 

comparing addresses in subsequent read and write requests to the write request address 
stored in the shared memory; 

transferring the write data to the shared memory; 

matching, within the shared memory, the write request address to the its corresponding 
write data; and 

storing the corresponding write data into the shared memory as a function of the write 
request address. 

35. (Currently Amended) The method according to claim 34, wherein issuing a write request 
includes ensuring that all vector and scalar loads from shared memory for that processor have 
been sent to the shared memory prior to issuing the write request. 

36. (Currently Amended) The method according to claim 34, wherein comparing addresses 
in subsequent read and write requests includes stalling the subsequent read requests to the write 
request address until the write data is written into the shared memory. 

37. (Currently Amended) The method according to claim 34, wherein comparing addresses 
in subsequent read and write requests includes enforcing memory ordering in the subsequent 
read and write requests to the write request address until the write data associated with the first 
write request is written into the shared memory. 
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38. (Currently Amended) A computer system, comprising: 

a plurality of processors, wherein the processors includes means for issuing a write 
address separate from data to be written to the write address; and 

a shared memory connected to the plurality of processors, wherein the shared memory 
includes: 

means for receiving a first write request including a first write address; and 
means for comparing addresses in subsequent load and store requests to the first 
write address; and 

means for stalling subsequent loads and stores load and store requests to a 
memory location in the shared memory associated with the first write address in shared 
memory until the data to be written to the write address associated with the first write 
request is received and written by the shared memory. 



